智能论文笔记

Integrating connection search in graph queries

Angelos Christos Anadiotis , Ioana Manolescu , Madhulika Mohanty

分类：人工智能

2022-08-09

图数据管理和查询具有许多实际应用。当图形非常异构和/或用户不熟悉其结构时，即使用户无法描述连接，他们也可能需要找到如何在图中连接两个或多个节点的组。这仅由现有查询语言部分支持，这些语言允许搜索路径，但不适合连接三个或更多节点组的树。后者与NP-HARD组Steiner树问题有关，以前已考虑用于数据库中的关键字搜索。在这项工作中，我们正式展示了如何在诸如SPARQL或Cypher之类的图形语言中集成连接的树模式（CTPS，简称CTP），从而导致扩展查询语言（或简而言之）。然后，我们研究一组评估CTP的算法；我们概括了先前的关键字搜索工作，最重要的是（i）考虑双向边缘遍历遍历和（ii）允许用户选择任何分数功能来排名CTP结果。为了应对非常大的搜索空间，我们提出了一种有效的修剪技术，并正式建立了大量的情况，即使我们的算法molesp也可以完成修剪。我们的实验验证了我们在大量合成和现实世界中的CTP和EQL评估算法的性能。

translated by 谷歌翻译

Accu-Help: A Machine Learning based Smart Healthcare Framework for Accurate Detection of Obsessive Compulsive Disorder

Kabita Patel , Ajaya Kumar Tripathy , Laxmi Narayan Padhy , Sujita Kumar Kar , Susanta Kumar Padhy , Saraju Prasad Mohanty

分类：机器学习

2022-12-05

In recent years the importance of Smart Healthcare cannot be overstated. The current work proposed to expand the state-of-art of smart healthcare in integrating solutions for Obsessive Compulsive Disorder (OCD). Identification of OCD from oxidative stress biomarkers (OSBs) using machine learning is an important development in the study of OCD. However, this process involves the collection of OCD class labels from hospitals, collection of corresponding OSBs from biochemical laboratories, integrated and labeled dataset creation, use of suitable machine learning algorithm for designing OCD prediction model, and making these prediction models available for different biochemical laboratories for OCD prediction for unlabeled OSBs. Further, from time to time, with significant growth in the volume of the dataset with labeled samples, redesigning the prediction model is required for further use. The whole process requires distributed data collection, data integration, coordination between the hospital and biochemical laboratory, dynamic machine learning OCD prediction mode design using a suitable machine learning algorithm, and making the machine learning model available for the biochemical laboratories. Keeping all these things in mind, Accu-Help a fully automated, smart, and accurate OCD detection conceptual model is proposed to help the biochemical laboratories for efficient detection of OCD from OSBs. OSBs are classified into three classes: Healthy Individual (HI), OCD Affected Individual (OAI), and Genetically Affected Individual (GAI). The main component of this proposed framework is the machine learning OCD prediction model design. In this Accu-Help, a neural network-based approach is presented with an OCD prediction accuracy of 86 percent.

translated by 谷歌翻译

DistGNN-MB: Distributed Large-Scale Graph Neural Network Training on x86 via Minibatch Sampling

Md Vasimuddin , Ramanarayan Mohanty , Sanchit Misra , Sasikanth Avancha

分类：机器学习

2022-11-11

Training Graph Neural Networks, on graphs containing billions of vertices and edges, at scale using minibatch sampling poses a key challenge: strong-scaling graphs and training examples results in lower compute and higher communication volume and potential performance loss. DistGNN-MB employs a novel Historical Embedding Cache combined with compute-communication overlap to address this challenge. On a 32-node (64-socket) cluster of $3^{rd}$ generation Intel Xeon Scalable Processors with 36 cores per socket, DistGNN-MB trains 3-layer GraphSAGE and GAT models on OGBN-Papers100M to convergence with epoch times of 2 seconds and 4.9 seconds, respectively, on 32 compute nodes. At this scale, DistGNN-MB trains GraphSAGE 5.2x faster than the widely-used DistDGL. DistGNN-MB trains GraphSAGE and GAT 10x and 17.2x faster, respectively, as compute nodes scale from 2 to 32.

translated by 谷歌翻译

Atrial Fibrillation Recurrence Risk Prediction from 12-lead ECG Recorded Pre- and Post-Ablation Procedure

Eran Zvuloni , Sheina Gendelman , Sanghamitra Mohanty , Jason Lewen , Andrea Natale , Joachim A. Behar

分类：机器学习

2022-08-22

简介：在房颤（AF）导管消融过程（CAP）期间记录了12条铅心电图（ECG）。如果没有长时间的随访评估AF复发（AFR），确定CAP是否成功并不容易。因此，AFR风险预测算法可以使CAP患者更好地管理。在这项研究中，我们从CAP前后记录的12铅ECG中提取功能，并训练AFR风险预测机学习模型。方法：从112例患者中提取前和后段段。该分析包括信号质量标准，心率变异性和由12铅ECG设计的形态生物标志物（总体804个功能）。在112名患者中，有43例AFR临床终点可用。这些用于使用前或后CAP特征来评估AFR风险预测的可行性。在嵌套的交叉验证框架内训练了一个随机的森林分类器。结果：发现36个特征在区分手术前和手术后具有统计学意义（n = 112）。对于分类，报告了接收器操作特性（AUROC）曲线下的区域，AUROC_PRE = 0.64，AUROC_POST = 0.74（n = 43）。讨论和结论：此初步分析表明AFR风险预测的可行性。这样的模型可用于改善盖帽管理。

translated by 谷歌翻译

FRIDA -- Generative Feature Replay for Incremental Domain Adaptation

Sayan Rakshit , Anwesh Mohanty , Ruchika Chavhan , Biplab Banerjee , Gemma Roig , Subhasis Chaudhuri

分类：计算机视觉 | 机器学习

2021-12-28

我们在本文中解决了增量无监督域适应（IDA）的新问题。我们假设标记的源域和不同的未标记的目标域通过约束逐步观察到与当前域的数据仅一次可用。目标是为当前域概括为所有过去域的准确性。 IDA设置因域之间的突然差异以及包括源域内的过去数据的不可用。受到生成功能重放的概念的启发，我们提出了一种名为特征重放的增量域适应（Frida）的新颖框架，它利用了一个名为域 - 通用辅助分类GaN（DGAC-GaN）的新的增量生成对抗性网络（GAN）来生产域明确的特征表示无缝。对于域对齐，我们提出了一种简单的扩展名为Dann-Ib的流行域对抗神经网络（Dann），鼓励歧视域 - 不变和任务相关的特征学习。 Office-Home，Office-Caltech和Domainnet数据集的实验结果证实，FIDA维护了卓越的稳定性可塑性权衡，而不是文献。

translated by 谷歌翻译

Emotions are Subtle: Learning Sentiment Based Text Representations Using Contrastive Learning

Ipsita Mohanty , Ankit Goyal , Alex Dotterweich

分类：自然语言处理

2021-12-02

对比的学习技术已广泛用于计算机视野中作为增强数据集的手段。在本文中，我们将这些对比学习嵌入的使用扩展到情绪分析任务，并证明了对这些嵌入的微调在基于BERT的嵌入物上的微调方面提供了改进，以在评估时实现更高的基准。在Dynasent DataSet上。我们还探讨了我们的微调模型在跨域基准数据集上执行的。此外，我们探索了ups采样技术，以实现更平衡的班级分发，以进一步改进我们的基准任务。

translated by 谷歌翻译

Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning & HPC Workloads

Evangelos Georganas , Dhiraj Kalamkar , Sasikanth Avancha , Menachem Adelman , Deepti Aggarwal , Cristina Anderson , Alexander Breuer , Jeremy Bruestle , Narendra Chaudhary , Abhisek Kundu

分类：人工智能

2021-04-12

在过去十年中，已经开发出新的深度学习（DL）算法，工作负载和硬件来解决各种问题。尽管工作量和硬件生态系统的进步，DL系统的编程方法是停滞不前的。 DL工作负载从DL库中的高度优化，特定于平台和不灵活的内核，或者在新颖的操作员的情况下，通过具有强大性能的DL框架基元建立参考实现。这项工作介绍了Tensor加工基元（TPP），一个编程抽象，用于高效的DL工作负载的高效，便携式实现。 TPPS定义了一组紧凑而多才多艺的2D张镜操作员（或虚拟张量ISA），随后可以用作构建块，以在高维张量上构建复杂的运算符。 TPP规范是平台 - 不可行的，因此通过TPPS表示的代码是便携式的，而TPP实现是高度优化的，并且特定于平台。我们展示了我们使用独立内核和端到端DL＆HPC工作负载完全通过TPPS表达的方法的效力和生存性，这在多个平台上优于最先进的实现。

translated by 谷歌翻译

The Multi-Agent Behavior Dataset: Mouse Dyadic Social Interactions

Jennifer J. Sun , Tomomi Karigo , Dipam Chakraborty , Sharada P. Mohanty , Benjamin Wild , Quan Sun , Chen Chen , David J. Anderson , Pietro Perona , Yisong Yue

分类：机器学习 | 计算机视觉

2021-04-06

多代理行为建模旨在了解代理之间发生的交互。我们从行为神经科学，Caltech鼠标社交交互（CALMS21）数据集中提供了一个多代理数据集。我们的数据集由社交交互的轨迹数据组成，从标准居民入侵者测定中自由行为小鼠的视频记录。为了帮助加速行为研究，CALMS21数据集提供基准，以评估三种设置中自动行为分类方法的性能：（1）用于培训由单个注释器的所有注释，（2）用于风格转移以进行学习互动在特定有限培训数据的新行为学习的行为定义和（3）的注释差异。 DataSet由600万个未标记的追踪姿势的交互小鼠组成，以及超过100万帧，具有跟踪的姿势和相应的帧级行为注释。我们的数据集的挑战是能够使用标记和未标记的跟踪数据准确地对行为进行分类，以及能够概括新设置。

translated by 谷歌翻译